255 results found.
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
Apache License v.2.0
Size:
19 GByteProduction Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:AISHELL-3: A Multi-Speaker Mandarin TTS Corpus
-
Paper track:7.13 Tools and data for speech synthesis/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yao Shi | AISHELL-3 | /N |
Documentation:
documentation in English, open access.
Speech
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
None
Size:
19877 wordsProduction Status:
Newly created-finished
Use:
Acquisition
-
Paper title:Segment and Tone Production in Continuous Speech of Hearing and Hearing-impaired Children
-
Paper track:13.3 Hearing disorders/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shu-Chuan Tseng | Sinica Child Speech Corpus | /N |
Documentation:
Yes
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
License:
Size:
None Production Status:
Use:
-
Paper title:Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
-
Paper track:14.12 Non-Autoregressive Sequential Modeling for S/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jumon Nozaki | AISHELL-1 | /N |
Documentation:
NoneLanguage Type:
Trilingual
Languages:
Egyptian Arabic English Mandarin Chinese
Availability:
The Data Will Be Published Via LDC General Catalogue
License:
<Not Specified>
Size:
2709094 words Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 2 | Martha Palmer | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 3 | Nianwen Xue | Computer Science Department, Brandeis University | US | ||
| Author 4 | Lance Ramshaw | Raytheon BBN Technologies | US | ||
| Author 5 | Mohamed Maamouri | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 6 | Ann Bies | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 7 | Kathryn Conger | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 8 | Stephen Grimes | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 9 | Stephanie Strassel | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Main Contact | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | None |
Documentation:
<Not Specified>
Treebank,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
License:
Size:
None Production Status:
Use:
-
Paper title:Multilingual Constituency Parsing with Self-Attention and Pre-Training
-
Paper track:Short/Tagging, Chunking, Syntax and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nikita Kitaev | Chinese Treebank 5.1 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English Japanese Mandarin Chinese
Availability:
Freely Available
License:
http://lotus.kuee.kyoto-u.ac.jp/ASPEC/#agreement.html
Size:
None Production Status:
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Bilingual Subword Segmentation for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hiroyuki Deguchi | Asian Scientific Paper Excerpt Corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Bilingual
Languages:
English Mandarin Chinese
Availability:
License:
LDC
Size:
None Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:On the Role of Style in Parsing Speech with Neural Models
-
Paper track:12.10 Metadata for ling./discourse structure (disf/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | GlobalTIMIT | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Arabic Bengali Central Khmer Chinese Dari Egyptian Arabic English Georgian Hindi Iranian Persian Italian Japanese Korean Lao Mandarin Chinese Min Nan Chinese Moroccan Arabic Northern Khmer Panjabi Persian Russian Spanish Tagalog Thai Tigrinya Urdu Uzbek Vietnamese Wu Chinese Yue Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:End-to-End Neural Speaker Diarization with Permutation-Free Objectives
-
Paper track:4.5 Speaker diarization/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yusuke Fujita | 2008 NIST Speaker Recognition Evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic English Mandarin Chinese Russian Spanish
Availability:
From Data Center(s)
License:
LDC
Size:
392 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:End-to-End Neural Speaker Diarization with Permutation-Free Objectives
-
Paper track:4.5 Speaker diarization/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yusuke Fujita | 2005 NIST Speaker Recognition Evaluation Training Data | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
Not Available
License:
Size:
85 hours Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Deep Learning based Mandarin Accent Identification for Accent Robust ASR
-
Paper track:3.4 Automatic analysis of speaker traits/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Felix Weninger | Mandarin Accent Database | /N |
Documentation:
None




